Using statistical models to predict phrase boundaries for speech synthesis

نویسندگان

Eric Sanders

Paul Taylor

چکیده

This paper describes a variety of methods for inserting phrase boundaries in text. The methods work by ex amining the likelihood of a phrase break occurring in a sequence of three part-of-speech tags. The paper explains this basic technique and desribes more sophisticaed vari ations using distance probabilities.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions

This paper presents an automatic phrase boundary labeling method for speech synthesis database annotation using contextdependent hidden Markov models (CD-HMMs) and n-gram prior distributions. At training stage, CD-HMMs are built to describe the conditional distribution of acoustic features given phonetic label and phrase boundary. In addition, n-gram models are estimated to represent the prior ...

متن کامل

تعیین مرز و نوع عبارات نحوی در متون فارسی

Text tokenization is the process of tokenizing text to meaningful tokens such as words, phrases, sentences, etc. Tokenization of syntactical phrases named as chunking is an important preprocessing needed in many applications such as machine translation information retrieval, text to speech, etc. In this paper chunking of Farsi texts is done using statistical and learning methods and the grammat...

متن کامل

Acoustic Cues for Automatic Determination of Phrasing

This paper proposes a framework of automatic determination of phrasing using acoustic features derived from the speech signal. The feature vectors were defined in a series of analyses investigating the acoustic-phonetic realization of minor and major phrase boundaries and different boundary types. The resulting representation was used to train statistical classifiers to automatically determine ...

متن کامل

Modeling of sentence-medial pauses in bangla readout speech: occurrence and duration

Control of pause occurrence and duration is an important issue for text-to-speech synthesis systems. In text-readout speech, pauses occur unconditionally at sentence boundaries and with high probability at major syntactic boundaries such as clause boundaries, but more or less arbitrarily at minor syntactic boundaries. Pause duration tends to be longer at the end of a longer syntactic unit. A de...

متن کامل

Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling

This paper proposes an automatic prosodic labeling technique for constructing speech database used for speech synthesis. In the corpus-based Japanese speech synthesis, it is essential to use annotated speech data with prosodic information such as phrase boundaries and accent types. However, manual annotation is generally time-consuming and expensive. To overcome this problem, we propose an esti...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1995

Using statistical models to predict phrase boundaries for speech synthesis

نویسندگان

چکیده

منابع مشابه

Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions

تعیین مرز و نوع عبارات نحوی در متون فارسی

Acoustic Cues for Automatic Determination of Phrasing

Modeling of sentence-medial pauses in bangla readout speech: occurrence and duration

Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling

عنوان ژورنال:

اشتراک گذاری